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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Bradfield, Christopher A. 

Dolwick, Kristin M. 
Carver, Lucy A. 

(ii) TITLE OF INVENTION: Ah Receptor cDNAs and Genetically 
Engineered Cells for Detecting Agonists to the Ah 

Receptor 

(iii) NUMBER OF SEQUENCES: 36 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Tilton, Fallon, Lungmus & Chestnut 

(B) STREET: 100 South Wacker Drive, Suite 960 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA * 

(F) ZIP: 60606-4002 l' 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Tilton, Timothy L. 

(B) REGISTRATION NUMBER: 16,926 

(C) REFERENCE/ DOCKET NUMBER: NU-9207-CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (312) -456-8000 

(B) TELEFAX: (312) -456-7776 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2415 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AGC AGC GGC GCC AAC ATC ACC TAT GCC AGC CGC AAG CGG CGC AAG 
48 

Met Ser Ser Gly Ala Asn lie Thr Tyr Ala Ser Arg Lys Arg Arg Lys 
1 5 10 15 

CCG GTG CAG AAA ACA GTA AAG CCC ATC CCC GCT GAA GGA ATT AAG TCA 
96 

Pro Val Gin Lys Thr Val Lys Pro lie Pro Ala Glu Gly lie Lys Ser 
20 25 30 

AAT CCT TCT AAG CGA CAC AG A GAC CGG CTG AAC ACA GAG TTA GAC CGC 

144 * 
Asn Pro Ser Lys Arg His Arg Asp Arg Leu Asn Thr Glu Leu Asp Arg 
35 40 ' 45 

CTG GCC AGC CTG CTG CCC TTC CCG CAA GAT GTT ATT AAT AAG CTG GAC 
192 

Leu Ala Ser Leu Leu Pro Phe Pro Gin Asp Val lie Asn Lys Leu Asp 
50 55 60 

AAA CTC TCT GTT CTT AGG CTC AGC GTC ACG TAC CTG AGG GCC AAG AGC 
240 

Lys Leu Ser Val Leu Arg Leu Ser Val Thr Tyr Leu Arg Ala Lys Ser 
65 70 75 80 

TTC TTT GAT GTT GCA TTA AAG TCC ACC CCT GCT GAC AGA AAT GGA GGC 
288 

Phe Phe Asp Val Ala Leu Lys Ser Thr Pro Ala Asp Arg Asn Gly Gly 

85 90 95 

CAG GAC CAG TGT AGA GCA CAA ATC AGA GAC TGG CAG GAT TTG CAA GAA 
336 

Gin Asp Gin Cys Arg Ala Gin lie Arg Asp Trp Gin Asp Leu Gin Glu 
100 105 110 

GGA GAG TTC TTG TTA CAG GCG CTG AAT GGC TTT GTG CTG GTT GTC ACA 
384 

Gly Glu Phe Leu Leu Gin Ala Leu Asn Gly Phe Val Leu Val Val Thr 
115 120 125 

GCA GAT GCC TTG GTC TTC TAT GCT TCC TCC ACT ATC CAA GAT TAC CTG 
432 



Ala Asp Ala Leu Val Phe Tyr Ala Ser Ser Thr lie Gin Asp Tyr Leu 
130 135 140 

GGC TTT CAG CAG TCT GAT GTC ATC CAT CAG AGC GTA TAT GAG CTC ATC 
480 

Gly Phe Gin Gin Ser Asp Val lie His Gin Ser Val Tyr Glu Leu lie 

145 150 155 160 

CAT ACA GAA GAC CGG GCG GAA TTC CAG CGC CAG CTT CAC TGG GCT CTA 
528 

His Thr Glu Asp Arg Ala Glu Phe Gin Arg Gin Leu His Trp Ala Leu 

165 170 175 

AAC CCA GAC TCT GCA CAA GGA GTG GAC GAA GCC CAT GGC CCT CCA CAG 
576 

Asn Pro Asp Ser Ala Gin Gly Val Asp Glu Ala His Gly Pro Pro Gin 

180 185 190 

GCA GCA GTC TAT TAT ACC CCA GAC CAG CTT CCT CCA GAG AAC GCT TCT 
624 

Ala Ala Val Tyr Tyr Thr Pro Asp Gin Leu Pro Pro Glu Asn Ala Ser 
195 200 i 205 

I' 

TTC ATG GAG AGG TGC TTC AGG TGC CGG CTG AGG ITGC CTG CTG GAT AAT 
672 

Phe Met Glu Arg Cys Phe Arg Cys Arg Leu Arg Cys Leu Leu Asp Asn 
210 215 220 

TCA TCT GGT TTT CTG GCA ATG AAT TTC CAA GGG AGG TTA AAG TAT CTT 
720- 

Ser Ser Gly Phe Leu Ala Met Asn Phe Gin Gly Arg Leu Lys Tyr Leu 
225 230 235 240 

CAT GGA CAG AAC AAG AAA GGG AAG GAC GGA GCG CTG CTT CCT CCA CAA 
768 

His Gly Gin Asn Lys Lys Gly Lys Asp Gly Ala Leu Leu Pro Pro Gin 

245 250 255 

CTG GCT TTG TTT GCA ATA GCT ACT CCA CTT CAG CCA CCC TCC ATC CTG 
816 

Leu Ala Leu Phe Ala lie Ala Thr Pro Leu Gin Pro Pro Ser lie Leu 
260 265 270 

GAA ATT CGA ACC AAA AAC TTC ATC TTC AGG ACC AAA CAC AAG CTA GAC 
864 

Glu lie Arg Thr Lys Asn Phe lie Phe Arg Thr Lys His Lys Leu Asp 
275 280 285 

TTC ACA CCT ATT GGT TGT GAT GCC AAA GGG CAG CTT ATT CTG GGC TAT 
912 

Phe Thr Pro lie Gly Cys Asp Ala Lys Gly Gin Leu lie Leu Gly Tyr 
290 295 300 
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ACA GAA GTA GAG 
960 

Thr Glu Val Glu 
305 

GCT GCA GAC ATA 
1008 

Ala Ala Asp lie 



ACT GGA GAA AGT 

1056 
Thr Gly Glu Ser 
340 

CGC TGG AGG TGG 
1104 

Arg Trp Arg Trp 
355 

AGA CCA GAT TAC 
1152 

Arg Pro Asp Tyr 
370 

GGA CGA GAG CAT 

1200 
Gly Arg Glu His 
385 

GCT ACC GGA GAG 

1248 
Ala Thr Gly Glu 



ATA ATG GAT CCC 

1296 
lie Met Asp Pro 
420 

GAC TGG GCT CCC 
1344 

Asp Trp Ala Pro 
435 

TCT CTT ATG AGT 
1392 

Ser Leu Met Ser 
450 

CCT CCT TCA AGC 
1440 



CTG TGC ACA AGA 

Leu Cys Thr Arg 
310 

CTT CAC TGT GCA 

Leu His Cys Ala 
325 

GGC ATG ACA GTT 
Gly Met Thr Val 

GTC CAG TCC AAT 

Val Gin Ser Asn 
360 

ATC ATC GCC ACT 

He He Ala Thr 
375 

TTA CAG AAG CGA 

Leu Gin Lys Arg 
390 

GCT GTG TTG TAC 

Ala Val Leu Tyr 
405 

CTA CCA ATA CGC 
Leu Pro He Arg 

CAG TCA ACC CCA 

Gin Ser Thr Pro 
440 

GCC CTC ATC CAG 

Ala Leu He Gin 
455 

CCT GCG CTG TTA 



GGA TCG GGG TAC 

Gly Ser Gly Tyr 
315 

GAA TCC CAC ATC 

Glu Ser His He 
330 

TTC CGG CTT CTT 

Phe Arg Leu Leu 
345 

GCA CGC TTG ATT 
Ala Arg Leu He 

CAG AGA CCA CTG 

i 

Gin Arg Pro iLeu 
'380 

AGT ACG TCG CTG 

Ser Thr Ser Leu 
395 

GAG ATC TCC AGC 

Glu He Ser Ser 
410 

ACC AAA AGC AAC 

Thr Lys Ser Asn 
425 

AGT AAG GAT TCT 
Ser Lys Asp Ser 

CAG GAT GAG TCC 

Gin Asp Glu Ser 
460 

GAC AGC CAT TTT 



CAG TTC ATC CAT 

Gin Phe He His 
320 

CGC ATG ATT hhG 

Arg Met He Lys 
335 

GCA AAA CAC AGT 

Ala Lys His Ser 
350 

TAC AGA AAT GGA 

Tyr Arg Asn Gly 
365 

ACG GAT GAA GAA 
Thr Asp Glu Glu 

CCC TTC ATG TTT 

Pro Phe Met Phe 
400 

CCT TTC TCT CCC 

Pro Phe Ser Pro 
415 

ACT AGC AGG AAA 

Thr Ser Arg Lys 
430 

TTC CAC CCC AGT 

Phe His Pro Ser 
445 

ATC TAT CTG TGT 
He Tyr Leu Cys 

CTC ATG GGC TCC 
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Pro Pro Ser Ser Pro Ala Leu Leu Asp Ser His Phe Leu Met Gly Ser 
465 470 475 480 

GTG AGC AAG TGC GGG AGT TGG CAA GAC AGC TTT GCG GCC GCA GGA AGT 
1488 

Val Ser Lys Cys Gly Ser Trp Gin Asp Ser Phe Ala Ala Ala Gly Ser 

485 490 495 

GAG GCT GCG CTG AAA CAT GAG CAA ATT GGC CAT GCT CAG GAC GTG AAC 
1536 

Glu Ala Ala Leu Lys His Glu Gin lie Gly His Ala Gin Asp Val Asn 
500 505 510 

CTT GCA CTC TCT GGC GGC CCC TCA GAG CTC TTT CCG GAT AAT AAA' AAT 
1584 

Leu Ala Leu Ser Gly Gly Pro Ser Glu Leu Phe Pro Asp Asn Lys Asn 
515 520 525 

AAT GAC TTG TAC AGC ATC ATG AGG AAC CTT GGG ATT GAT TTT GAA GAT 
1632 

Asn Asp Leu Tyr Ser lie Met Arg Asn Leu Gly lie Asp Phe Glu Asp 
530 535 54i0 

ATC AGA AGC ATG CAG AAC GAG GAG TTC TTC AGA ACT GAC TCC ACC GCT 
1680 

lie Arg Ser Met Gin Asn Glu Glu Phe Phe Arg Thr Asp Ser Thr Ala 

545 550 555 ■ 560 

GCT GGT GAG GTT GAC TTC AAA GAC ATC GAC ATA ACG GAC GAA ATC CTG 
1728. 

Ala Gly Glu Val Asp Phe Lys Asp lie Asp lie Thr Asp Glu lie Leu 

565 570 575 

ACC TAC GTG CAG GAT TCC CTG AAC AAT TCA ACT TTG CTG AAC TCG GCT 
1776 

Thr Tyr Val Gin Asp Ser Leu Asn Asn Ser Thr Leu Leu Asn Ser Ala 
580 585 590 

TGC CAG CAG CAG CCT GTG ACT CAG CAC CTA AGC TGT ATG CTG CAG GAG 
1824 

Cys Gin Gin Gin Pro Val Thr Gin His Leu Ser Cys Met Leu Gin Glu 
595 600 605 

CGC CTG CAA CTA GAG CAA CAG CAA CAG CTT CAG CAG CCC CCG CCG CAG 
1872 

Arg Leu Gin Leu Glu Gin Gin Gin Gin Leu Gin Gin Pro Pro Pro Gin 
610 615 620 

GCT CTG GAG CCC CAG CAG CAG CTG TGT CAG ATG GTG TGC CCC CAG CAA 
1920 

Ala Leu Glu Pro Gin Gin Gin Leu Cys Gin Met Val Cys Pro Gin Gin 
625 630 635 640 
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GAT CTG GOT CCG AAG CAC ACG CAA ATC AAC GGC ACG TTT GCA AGT TGG 
1968 

Asp Leu Gly Pro Lys His Thr Gin lie Asn Gly Thr Phe Ala Ser Trp 

645 650 655 

AAC CCC ACC CCT CCC GTG TCT TTC AAC TGT CCC CAG CAG GAA CTA AAG 
2016 

Asn Pro Thr Pro Pro Val Ser Phe Asn Cys Pro Gin Gin Glu Leu Lys 
660 665 670 

CAC TAT CAG CTC TTT TCC AGC TTA CAG GGG ACT GCT CAG GAA TTT CCC 
2064 

His Tyr Gin Leu Phe Ser Ser Leu Gin Gly Thr Ala Gin Glu Phe Pro 
675 680 685 

TAC AAA CCA GAG GTG GAC AGT GTG CCT TAC ACA CAG AAC TTT GCT CCC 
2112 

Tyr Lys Pro Glu Val Asp Ser Val Pro Tyr Thr Gin Asn Phe Ala Pro 
690 695 .700 

TGT AAT CAG CCT CTG CTT CCA GAA CAT TCC AAG AGT GTG CAG TTG GAC 

2160 i 

Cys Asn Gin Pro Leu Leu Pro Glu His Ser Lys i^er Val Gin Leu Asp 

705 710 715 ' 720 

TTC CCT GGA AGG GAT TTT GAA CCG TCC CTG CAT CCC ACT ACT TCT AAT 
2208 

Phe Pro Gly Arg Asp Phe Glu Pro Ser Leu His Pro Thr Thr Ser Asn 

725 730 735 

TTA GAT TTT GTC AGT TGT TTA CAA GTT CCT GAA AAC CAA AGT CAT GGG 
2256 

Leu Asp Phe Val Ser Cys Leu Gin Val Pro Glu Asn Gin Ser His Gly 
740 745 750 

ATA AAC TCA CAG TCC GCC ATG GTC AGT CCT CAG GCA TAC TAT GCT GGG 
2304 

lie Asn Ser Gin Ser Ala Met Val Ser Pro Gin Ala Tyr Tyr Ala Gly 
755 760 765 

GCC ATG TCC ATG TAT CAG TGC CAG CCA GGG CCA CAG CGC ACC CCT GTG 
2352 

Ala Met Ser Met Tyr Gin Cys Gin Pro Gly Pro Gin Arg Thr Pro Val 
770 775 780 • 

GAC CAG ACG CAG TAC AGC TCT GAA ATT CCA GGT TCT CAG GCA TTC CTA 
2400 

Asp Gin Thr Gin Tyr Ser Ser Glu lie Pro Gly Ser Gin Ala Phe Leu 
785 790 795 800 

AGC AAG GTG CAG AGT TGAGGTGTTT TCAATGAAAC CTATTCGTCC GACTTGAGCA 
2455 
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Ser Lys Val Gin Ser 

805 

GCATTGGCCA CGCTGCTCAG ACCACTGGCC ATCTCCATCA CTGCGGAAGC CCGGCCTCTT 
2515 

CCCGATATCA CACCCGGTGG ATTCCTGTAG CTCCCATGCC AGGATGAAAT TCATTCAGGA 
2575 

ACAGGATACC AGAACTGTGA GGGTTGGACA TCAGTACACT TTCTCCAAAA CAGATTTCGA 
2635 

TTCTTGTGTT TAGAGAAGGA GTTTAAAACC CGTACCTGAG ATGCTCCCTA TACGATGGGA 
2695 

GAGCTCGGAC GGAGCACATG GGAGGAGTTC AGGCACCTCA GAGTGCACAG TGTTTACTGT 
2755 

GAAAAATTCT CGGGTTCCCT GCTCAGTAAC TTCAGCAGGA AAAACAGGGA GGTATTTGGA 
2815 

GCTTTGAACT TCTGGATTCT TGTTAGTATA CCAAATACGG AGTTACAGGA CTAACCGATT 
2875 ,, 



TCCTATATTT TTTAACCTCT GTTTTTGTCC CAGAAGTTAA AGTAAATGGT TTGGTGCTTT 
2935 

TCTCAAAAGA AAATCTCAAT GCTTTCTTTC TGCACTGTTA ATATAAGTGC CTCACTTTTT 
2995 



GTTGTTGTTG TTGTTGTTTT CTGATTTTTT TCTTTTTTTC TATCTACCTG TAACACAATA 
3055 



GGGTATGTAT TTTATATGAA ATATTTTTTA TCTTTTTTGA ATTAATATTC TTTCTGCACA 
3115 

AAGAAAGTTT CCCGAATCCC AACCTTTCTA TGACCCCGCT GTGTGTGTGC ACTACTCATC 
3175 

CTTTCCTTCA GATAAAGAGT AATTGATAAC TC 
3207 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Ser Gly Ala Asn lie Thr Tyr Ala Ser Arg Lys Arg Arg Lys 

1 5 I 10 15 

Pro Val Gin Lys Thr Val Lys Pro lie Pro Ala Glu Gly lie Lys Ser 
20 25 30 

Asn Pro Ser Lys Arg His Arg Asp Arg Leu Asn Thr Glu Leu Asp Arg 
35 40 45 

Leu Ala Ser Leu Leu Pro Phe Pro Gin Asp Val lie Asn Lys Leu Asp 
50 55 60 

Lys Leu Ser Val Leu Arg Leu Ser Val Thr Tyr Leu Arg Ala Lys Ser 
65 70 75 80 

Phe Phe Asp Val Ala Leu Lys Ser Thr Pro Ala Asp Arg Asn Gly Gly 

85 90 95 

Gin Asp Gin Cys Arg Ala Gin lie Arg Asp Trp Gin Asp Leu Gin Glu 
100 105 i 110 

I* 

Gly Glu Phe Leu Leu Gin Ala Leu Asn Gly Phe Val Leu Val Val Thr 
115 120 125 

Ala Asp Ala Leu Val Phe Tyr Ala Ser Ser Thr lie Gin Asp Tyr Leu 
130 135 140 

Gly Phe Gin Gin Ser Asp Val lie His Gin Ser Val Tyr Glu Leu He 
145 150 155 160 

His Thr Glu Asp Arg Ala Glu Phe Gin Arg Gin Leu His Trp Ala Leu 

165 170 175 

Asn Pro Asp Ser Ala Gin Gly Val Asp Glu Ala His Gly Pro Pro Gin 
180 185 190 

Ala Ala Val Tyr Tyr Thr Pro Asp Gin Leu Pro Pro Glu Asn Ala Ser 
195 200 205 

Phe Met Glu Arg Cys Phe Arg Cys Arg Leu Arg Cys Leu Leu Asp Asn 
210 215 220 , 

Ser Ser Gly Phe Leu Ala Met Asn Phe Gin Gly Arg Leu Lys Tyr Leu 
225 230 235 240 

His Gly Gin Asn Lys Lys Gly Lys Asp Gly Ala Leu Leu Pro Pro Gin 

245 250 255 

Leu Ala Leu Phe Ala He Ala Thr Pro Leu Gin Pro Pro Ser He Leu 
260 265 270 
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Glu lie Arg Thr Lys Asn Phe lie Phe Arg Thr Lys His Lys Leu Asp 
275 280 285 

Phe Thr Pro lie, Gly Cys Asp Ala Lys Gly Gin Leu lie Leu Gly Tyr 
290 295 300 

Thr Glu Val Glu Leu Cys Thr Arg Gly Ser Gly Tyr Gin Phe lie His 
305 310 315 320 

Ala Ala Asp lie Leu His Cys Ala Glu Ser His lie Arg Met lie Lys 

325 330 335 

Thr Gly Glu Ser Gly Met Thr Val Phe Arg Leu Leu Ala Lys His Ser 
340 345 350 

Arg Trp Arg Trp Val Gin Ser Asn Ala Arg Leu lie Tyr Arg Asn Gly 
355 360 365 

Arg Pro Asp Tyr lie lie Ala Thr Gin Arg Pro Leu Thr Asp Glu Glu 
370 375 380 

□ 

ffl Gly Arg Glu His Leu Gin Lys Arg Ser Thr Ser Leju Pro Phe Met Phe 

385 390 395 400 



Ala Thr Gly Glu Ala Val Leu Tyr Glu lie Ser Ser Pro Phe Ser Pro 

405 410 415 

lie Met Asp Pro Leu Pro lie Arg Thr Lys Ser Asn Thr Ser Arg Lys 
420 425 430 

Asp Trp Ala Pro Gin Ser Thr Pro Ser Lys Asp Ser Phe His Pro Ser 
435 440 445 

Ser Leu Met Ser Ala Leu lie Gin Gin Asp Glu Ser lie Tyr Leu Cys 
450 455 460 

Pro Pro Ser Ser Pro Ala Leu Leu Asp Ser His Phe Leu Met Gly Ser 
465 470 475 480 

Val Ser Lys Cys Gly Ser Trp Gin Asp Ser Phe Ala Ala Ala Gly Ser 

485 490 495 

Glu Ala Ala Leu Lys His Glu Gin He Gly His Ala Gin Asp Val Asn 
500 505 ; 510 

Leu Ala Leu Ser Gly Gly Pro Ser Glu Leu Phe Pro Asp Asn Lys Asn 
515 520 525 

Asn Asp Leu Tyr Ser He Met Arg Asn Leu Gly He Asp Phe Glu Asp 
530 535 540 

He Arg Ser Met Gin Asn Glu Glu Phe Phe Arg Thr Asp Ser Thr Ala 



r 
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545 

Ala Gly 
Thr Tyr 
Cys Gin 



Arg Leu 
610 



550 



555 



560 



Glu Val Asp 
565 

Val Gin Asp 
580 

Gin Gin Pro 
595 



Phe Lys Asp lie Asp lie Thr Asp Glu lie Leu 

570 575 

Ser Leu Asn Asn Ser Thr Leu Leu Asn Ser Ala 
585 590 

Val Thr Gin His Leu Ser Cys Met Leu Gin Glu 
600 605 



Gin Leu Glu Gin Gin Gin Gin Leu Gin Gin Pro Pro Pro Gin 

615 620 



Ala Leu 
625 

Asp Leu 



Asn Pro 



His Tyr 



Glu Pro Gin Gin Gin Leu Cys Gin Met Val Cys Pro Gin Gin 
630 635 640 



Gly Pro Lys 
645 

Thr Pro Pro 
660 

Gin Leu Phe 
675 



His Thr Gin lie Asn Gly Thr Phe Ala Ser Trp 

650 655 

Val Ser Phe Asn Cys Pro Gin Gin Glu Leu Lys 
665 j 670 

Ser Ser Leu Gin Gly Thr Ala Gin Glu Phe Pro 
680 685 



Tyr Lys Pro Glu Val Asp Ser Val Pro Tyr Thr Gin Asn Phe Ala Pro 
690 695 700 



Cys Asji 
705 

Phe Pro 
Leu Asp 
lie Asn 



Gin Pro Leu Leu Pro Glu His Ser Lys Ser Val Gin Leu Asp 
710 715 720 



Gly Arg Asp 
725 

Phe Val Ser 
740 

Ser Gin Ser 
755 



Phe Glu Pro Ser Leu His Pro Thr Thr Ser Asn 

730 735 

Cys Leu Gin Val Pro Glu Asn Gin Ser His Gly 

745 750 

Ala Met Val Ser Pro Gin Ala Tyr Tyr Ala Gly 

760 765 



Ala Met Ser Met Tyr Gin Cys Gin Pro Gly Pro Gin Arg Thr Pro Val 
770 775 780 



Asp Gin Thr Gin Tyr Ser Ser Glu lie Pro Gly Ser Gin Ala Phe Leu 
785 790 795 800 

Ser Lys Val Gin Ser 

805 



(2) INFORMATION 



FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 383.. 2927 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AATTCCGCAC GGCCCAGACC CAGGATTCTT TATAGACGGC CCAGGCTCCT CCTCCGCCCG 
60 

GGCCGCCTCA CCTGCGGGCA TTGCGCGCCG CCTCCGCCGG TGTAGACGGC ACCTGCGCCG 
120 

CCTTGCTCGC GGGTCTCCGC CCTCGCCCAC CCTCACTGCG GCAGGCCCAG GCAGCTCACC 
180 i 

TGTACTGGCG CGGGCTGCGG AAGCTGCGTG ACGCGAGGCG TTGAGGCGCG GCGCCCACGC 
240 

CACTGTCCCG AGAGGACGCA GGTGGAGCGG GCGCGACTTC GCGAACCCGG CGCCGGCCGC 
300. 

CGCAGTGGTC CCAGCCTACA CCGGGTTCCG GGGACCCGGC CGCCAGTGCC CGGGGAGTAG 
360 

CCGCCGCCGT CGGCTGGGCA CC ATG AAC AGC AGC AGC GCC AAC ATC ACC TAG 
412 

Met Asn Ser Ser Ser Ala Asn lie Thr Tyr 
15 10 



GCC AGT CGC AAG CGG CGG AAG CCG GTG CAG AAA ACA GTA AAG CCA ATC 
460 

Ala Ser Arg Lys Arg A.rg Lys Pro Val Gin Lys Thr Val Lys Pro He 

15 20 25 

CCA GCT GAA GGA ATC AAG TCA AAT CCT TCC AAG CGG CAT AGA GAC CCA 
508 

Pro Ala Glu Gly lie Lys Ser Asn Pro Ser Lys Arg His Arg Asp Arg 
30 35 40 

CTT AAT ACA GAG TTG GAC CGT TTG GCT AGC CTG CTG CCT TTC CCA CAA 



556 

Leu Asn Thr Glu Leu 
45 

GAT GTT ATT AAT AAG 
604 

Asp Val lie Asn Lys 
60 

AGT TAG CTG AGA GCC 
652 

Ser Tyr Leu Arg Ala 
75 

COT ACT GAA AGA AAC 
700 

Pro Thr Glu Arg Asn 

95 

AGA GAA GGC CTG AAC 
748 

Arg Glu Gly Leu Asn 
110 

AAT GGC TTT GTA TTA 
796 

Asn Gly Phe Val Leu 
125 

TCT TCT ACT ATA CAA 
844 

Ser Ser Thr lie Gin 
140 

CAT CAG AGT GTA TAT 
892 

His Gin Ser Val Tyr 
155 

CAG CGT CAG CTA CAC 
940 

Gin Arg Gin Leu His 

175 

GGA CAA GGA ATT GAA 
988 

Gly Gin Gly He Glu 
190 

TAT AAC CCA GAC CAG 
1036 

Tyr Asn Pro Asp Gin 
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Asp Arg Leu Ala Ser 
50 

TTG GAC AAA CTT TCA 

Leu Asp Lys Leu Ser 
65 

AAG AGC TTC TTT GAT 

Lys Ser Phe Phe Asp 
80 

GGA GGC CAG GAT AAC 

Gly Gly Gin Asp Asn 

100 

TTA CAA GAA GGA GAA 

Leu Gin Glu Gly Glu 
115 

GTT GTC ACT ACA GAT 

Val Val Thr Thr Asp 
130 

GAT TAT CTA GGG TTT 

Asp Tyr Leu Gly Phe 
145 

GAA CTT ATC cat ACC 

Glu Leu He His Thr 
160 

TGG GCA TTA AAT CCT 

Trp Ala Leu Asn Pro 

180 

GAA GCC ACT GGT CTC 

Glu Ala Thr Gly Leu 
195 

ATT CCT CCA GAA AAC 
He Pro Pro Glu Asn 



Leu Leu Pro Phe Pro Gin 
55 

GTT CTT AGG CTC AGC GTC 

Val Leu Arg Leu Ser Val 
70 

GTT GCA TTA AAA TCC TCC 

Val Ala Leu Lys Ser Ser 
85 90 

TGT AGA GCA GCA AAT TTC 

Cys Arg Ala Ala Asn Phe 

105 

TTC TTA TTA CAG GCT CTG 

Phe Lqu Leu Gin Ala Leu 
120 

GCT TTG GTC TTT TAT GCT 

Ala Leu Val Phe Tyr Ala 
135 

CAG CAG TCT GAT GTC ATA 

Gin Gin Ser Asp Val He 
150 

GAA GAC CGA GCT GAA TTT 

Glu Asp Arg Ala Glu Phe 
165 170 

TCT CAG TGT ACA GAG TCT 

Ser Gin Cys Thr Glu Ser 

185 

CCC CAG ACA GTA GTC TGT 

Pro Gin Thr Val Val Cys 
200 

TCT CCT TTA ATG GAG AGG 
Ser Pro Leu Met Glu Arq 



205 

TGC TTC ATA TGT 
1084 

Cys Phe lie Cys 
220 

CTG GCA ATG AAT 
1132 

Leu Ala Met Asn 
235 

AAG AAA GGG AAA 

1180 
Lys Lys Gly Lys 



GCG ATA GCT ACT 
1228 

Ala He Ala Thr 
270 

AAA AAT TTT ATC 

1276 
Lys Asn Phe He 
285 

GGT TGT GAT GCC 
1324 

Gly Cys Asp Ala 
300 

CTG TGC ACG AGA 
1372 

Leu Cys Thr Arg 
315 

CTT TAT TGT GCC 

1420 
Leu Tyr Cys Ala 



GGC ATG ATA GTT 
1468 

Gly Met He Val 
350 

GTC CAG TCT AAT 
1516 

Val Gin Ser Asn 
365 



210 

CGT CTA AGG TGT 

Arg Leu Arg Cys 
225 

TTC CAA GGG AAG 

Phe Gin Gly Lys 
240 

GAT GGA TCA ATA 

Asp Gly Ser He 
255 

CCA CTT CAG CCA 
Pro Leu Gin Pro 

TTT AGA ACC AAA 

Phe Arg Thr Lys 
290 

AAA GGA AGA ATT 

Lys Gly Arg He 
305 

GGC TCA GGT TAT 

Gly Ser Gly Tyr 
320 

GAG TCC CAT ATC 

Glu Ser His He 
335 

TTC CGG CTT CTT 
Phe Arg Leu Leu 

GCA CGC CTG CTT 

Ala Arg Leu Leu 
370 



CTG CTG GAT AAT 

Leu Leu Asp Asn 
230 

TTA AAG TAT CTT 

Leu Lys Tyr Leu 
245 

CTT CCA CCT CAG 

Leu Pro Pro Gin 
260 

CCA TCC ATA CTT 

Pro Ser He Leu 
275 

. i 

CAC AAA CTA pAC 
His Lys Leu Asp 



GTT TTA GGA TAT 

Val Leu Gly Tyr 
310 

CAG TTT ATT CAT 

Gin Phe He His 
325 

CGA ATG ATT AAG 

Arg Met He Lys 
340 

ACA AAA AAC AAC 

Thr Lys Asn Asn 
355 

TAT AAA AAT GGA 
Tyr Lys Asn Gly 



215 

TCA TCT GGT TTT 
Ser Ser Gly Phe 

CAT GGA CAG AAA 

His Gly Gin Lys 
250 

TTG GCT TTG TTT 

Leu Ala Leu Phe 
265 

GAA ATC CGG ACC 

Glu He Arg Thr 
280 

TTC ACA CCT ATT 

Phe Thr Pro He 
295 

ACT GAA GCA GAG 

Thr Glu Ala Glu 

GCA GCT GAT ATG 

Ala Ala Asp Met 

330 

ACT GGA GAA AGT 

Thr Gly Glu Ser 
345 

CGA TGG ACT TOG 

Arg Trp Thr Trp 
360 

AGA CCA GAT TAT 

Arg Pro Asp Tyr 
375 
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ATC ATT GTA ACT 

1564 
He He Val Thr 
380 

TTA CGA AAA CGA 
1612 

Leu Arg Lys Arg 
395 

OCT GTG TTG TAT 

1660 
Ala Val Leu Tyr 



TTA CCA CTA AGG 

1708 
Leu Pro Leu Arg 
430 

ACA TCC ACT CTA 

1756 
Thr Ser Thr Leu 
445 

GCC ATG ATG CAA 
1804 

Ala Met Met Gin 
460 

ACT TCA AGT ACT 
1852 

Thr Ser Ser Thr 
475 

AAT GAA TGC AGA 

1900 
Asn Glu Cys Arg 



ACT ATC CTG AAA 

1948 
Thr He Leu Lys 
510 

TTT GCT GGA GGT 

1996 
Phe Ala Gly Gly 
525 

TTG TAG AGC ATA 
2044 



CAG AGA CCA CTA 

Gin Arg Pro Leu 
385 

AAT ACG AAG TTG 

Asn Thr Lys Leu 
400 

GAG GCA ACC AAC 

Glu Ala Thr Asn 
415 

ACT AAA AAT GGC 
Thr Lys Asn Gly 

AGC AAG GAC TCT 

Ser Lys Asp Ser 
450 

CAA GAT GAG TCT 

Gin Asp Glu Ser 
465 

GCA CCT TTT GAA 

Ala Pro Phe Glu 
480 

AAT TGG CAA GAT 

Asn Trp Gin Asp 
495 

CAT GAG CAA ATT 
His Glu Gin He 

CAC CCA GGG CTC 

His Pro Gly Leu 
530 

ATG AAA AAC CTA 



ACA GAT GAG GAA 

Thr Asp Glu Glu 
390 

CCT TTT ATG TTT 

Pro Phe Met Phe 
405 

CCT TTT CCT GCC 

Pro Phe Pro Ala 
420 

ACT AGT GGA AAA 

Thr Ser Gly Lys 
435 

CTC AAT CCT AGT 

i 

Leu Asn Pro Ser 

ATT TAT CTC TAT 

He Tyr Leu Tyr 
470 

AAC AAC TTT TTC 

Asn Asn Phe Phe 
485 

AAT ACT GCA CCG 

Asn Thr Ala Pro 
500 

GAC CAG CCT CAG 

Asp Gin Pro Gin 
515 

TTT CAA GAT AGT 
Phe Gin Asp Ser 

GGC ATT GAT TTT 



GGA ACA GAG CAT 
Gly Thr Glu His 

ACC ACT GGA GAA 

Thr Thr Gly Glu 
410 

ATA ATG GAT CCC 

He Met Asp Pro 
425 

GAC TCT GCT ACC 

Asp Ser Ala Thr 
440 

TCC CTC CTG GCT 

Ser Leu Leu Ala 
455 

CCT GCT TCA AGT 
Pro Ala Ser Ser 

AAC GAA TCT ATG 

Asn Glu Ser Met 
490 

ATG GGA AAT GAT 

Met Gly Asn Asp 
505 

GAT GTG AAC TCA 

Asp Val Asn Ser 
520 

AAA AAC AGT GAC 

Lys Asn Ser Asp 
535 

GAA GAC ATC AGA 
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Leu Tyr Ser lie Met Lys Asn Leu Gly lie Asp Phe Glu Asp lie Arg 
540 545 550 

CAC ATG CAG AAT GAA AAA TTT TTC AGA AAT GAT TTT TCT GGT GAG GTT 
2092 

His Met Gin Asn Glu Lys Phe Phe Arg Asn Asp Phe Ser Gly Glu Val 
555 560 565 570 

.GAC TTC AGA GAG ATT GAC TTA ACG GAT GAA ATC CTG ACG TAT GTC CAA 
2140 

Asp Phe Arg Asp lie Asp Leu Thr Asp Glu lie Leu Thr Tyr Val Gin 

575 580 585 

GAT TCT TTA AGT AAG TCT CCC TTC ATA CCT TCA GAT TAT CAA CAG CAA 
2188 

Asp Ser Leu Ser Lys Ser Pro Phe lie Pro Ser Asp Tyr Gin Gin Gin 
590 595 600 

CAG TCC TTG GCT CTG AAC TCA AGC TGT ATG GTA CAG GAA CAC CTA CAT 
2236 

Gin Ser Leu Ala Leu Asn Ser Ser Cys Met Val Gin Glu His Leu His 
605 610 i 615 

I' 

CTA GAA CAG CAA CAG CAA CAT CAC CAA AAG CAA GTA GTA GTG GAG CCA 
2284 

Leu Glu Gin Gin Gin Gin His His Gin Lys Gin Val Val Val Glu Pro 
620 625 630 

CAG CAA CAG CTG TGT CAG AAG ATG AAG CAC ATG CAA GTT AAT GGC ATG 
2332- 

Gin Gin Gin Leu Cys Gin Lys Met Lys His Met Gin Val Asn Gly Met 
635 640 645 650 

TTT GAA AAT TGG AAC TCT AAC CAA ATC GTG CCT TTC AAT TGT CCA CAG 
2380 

Phe Glu Asn Trp Asn Ser Asn Gin lie Val Pro Phe Asn Cys Pro Gin 

655 660 665 

CAA GAC CCA CAA CAA TAT AAT GTC TTT ACA GAC TTA CAT GGG ATC AGT 
2428 

Gin Asp Pro Gin Gin Tyr Asn Val Phe Thr Asp Leu His Gly lie Ser 
670 675 680 

CAA GAG TTC CCC TAC AAA TCT GAA ATG GAT TCT ATG CCT TAT ACA CAG 
2476 

Gin Glu Phe Pro Tyr Lys Ser Glu Met Asp Ser Met Pro Tyr Thr Gin 
685 690 695 

AAC TTT ATT TCC TGT AAT CAG CCT GTA TTA CCA CAA CAT TCC AAA TGT 
2524 

Asn Phe lie Ser Cys Asn Gin Pro Val Leu Pro Gin His Ser Lys Cys 

700 705 710 



I 
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ACA GAG CTG GAC TAG CCT ATG GGG AGT TTT GAA CCA TCC CCA TAC CCC 
2572 

Thr Glu Leu Asp Tyr Pro Met Gly Ser Phe Glu Pro Ser Pro Tyr Pro 
715 720 725 730 

ACT ACT TCT AGT TTA GAA GAT TTT GTC ACT TGT TTA CAA CTT CCT GAA 
2620 

Thr Thr Ser Ser Leu Glu Asp Phe Val Thr Cys Leu Gin Leu Pro Glu 

735 740 745 

AAC CAA AAG CAT GGA TTA AAT CCA CAG TCA GCC ATA ATA ACT CCT CAG 
2668 

Asn Gin Lys His Gly Leu Asn Pro Gin Ser Ala lie lie Thr Pro Gin 
750 755 760 

ACA TGT TAT GCT GGG GCC GTG TCG ATG TAT CAG TGC CAG CCA GAA CCT 
2716 

Thr Cys Tyr Ala Gly Ala Val Ser Met Tyr Gin Cys Gin Pro Glu Pro 
765 770 775 

CAG CAC ACC CAC GTG GGT CAG ATG CAG TAC AAT CCA GTA CTG CCA GGC 

2764 i 

Gin His Thr His Val Gly Gin Met Gin Tyr Asn jPro Val Leu Pro Gly 
780 785 *790 

CAA CAG GCA TTT TTA AAC AAG TTT CAG AAT GGA GTT TTA AAT GAA ACA 
2812 

Gin Gin Ala Phe Leu Asn Lys Phe Gin Asn Gly Val Leu Asn Glu Thr 
795 800 805 810 

TAT CCA GCT GAA TTA AAT AAC ATA AAT AAC ACT CAG ACT ACC ACA CAT 
2860 

Tyr Pro Ala Glu Leu Asn Asn lie Asn Asn Thr Gin Thr Thr Thr His 

815 820 825 

CTT CAG CCA CTT CAT CAT CCG TCA GAA GCC AGA CCT TTT CCT GAT TTG 
2908 

Leu Gin Pro Leu His His Pro Ser Glu Ala Arg Pro Phe Pro Asp Leu 
830 835 840 

ACA TCC AGT GGA TTC CTG T AATTCCAAGC CCAATTTTGA CCCTGGTTTT 
2957 

Thr Ser Ser Gly Phe Leu 
845 

TGGATTAAAT TAGTTTGTGA AGGATTATGG AAAAATAAAA CTGTCACTGT TGGACGTCAG 
3017 

CAAGTTCACA TGGAGGCATT GATGCATGCT ATTCACAATT ATTCCAAACC AAATTTTAAT 
3077 

TTTTGCTTTT AGAAAAGGGA GTTTAAAAAT GGTATCAAAA TTACATATAC TACAGTCAAG 



3137 

ATAGAAAGGG TGCTGCCACG 
3197 

ACCACAAAAT ATACAAAACT 
3257 

CTCTATTTGA ATTATTTCTG 
3317 

ATTCTTATTA GTTCCCCAAA 
3377 

AACCTCTGCT TTTATCTCAG 
3437 

ATCTCAGTGC TTTCCTCCTT 
3497 

AACACTCTAG GATGTATATT 
3557 

TCTGCACACA AATATTATTT 
3617 



ATTTTAACTC CACTGCTTAC 
3677 

ATAATTATTT ATTACATAAT 
3737 

GTTGAAAAAA TTTAAAAGTA 
3797 

TTAAGACAAT AGCACTTAAA 
3857 

TTTACCTTTA TTGATATCTC 
3917 

TTAACTGCAA CCTTGTTTAT 
3977 

AAAGCACCCA TTTCAATGTG 
4037 

TGATAATATA TTGTGTTGAC 
4097 

TTTTCTCATA TTTGAGGAGT 
4157 
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GAGTGGTGAG GTACCGTCTA 
TTATCAGGGA AACTAAGATT 
TCACAGTAAA AATAAAATAC 
TACAAAGTTA GAGAACTAAA 
ATGTTAAAAT AAATGGTTTG 
CACTGTTTCA TCTAAGTGCC 
TTATATAAAG TATTCTTTTT 
GTGTTTCCTA AATCCAACCA 
CTACTTTCTT CAGGTAAAAG 
TTAGTTGTTT CTAGACTATA 
AAATGTCTTT CCAAATTATT 
TTCCTCAACA GTGTTTTCAG 
CATGATGATA GTTGAATGTT 
TAAATTGCAA GAAGCTTTAT 
TATAAATTGT CTTTAAAAAC 
TTTATAAATT TCGCTTCTTA 
GTTAAGATTG CAGATAGCAA 



CATTTCACAT TATTCTGGGC 

i 
t 

CTTTTAAATT AGAAAATATT 
TTTGAGTTTT GAGCTACTGG 
CTAGTTTTTC CTATCATGTT 
GTGCTTTTTA TAAAAAGATA 
TCACATTTTT TTCTACCTAT 

CTTTTTTAAA TTAATATCTT 

i 

'i^TTTTCATTA ATTCAGGCAT 
GGCAAATAAT GATCGAAAAA 
AATGTTGCTA TGTGCCTTAT 
TCTTAATTAT TATAAAAATA 
AAGAAATAAA TATACCACTC 
GCAATGTGAA AAATCTGCTG 
TTCTAGCTTT TTAATTAAGC 
TGTTTTAGAC CTATAATCCT 
GAACAGTGGA AACTATGTGT 
GGTTTGGTGC AAAGTATTGT 
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AATGAGTGAA TTGAATGGTG CATTGTATAG 
4217 

TTGCAGTTTT TCATTTTAAA AAGTCCATAC 
4277 

TACATACTTT ATCAATGTGT CTTTCTAAGA 
4337 

TGGTATTAAT AAAAAGACAA CCACATAGTT 
4397 

TGATATACTG ATCTTCATTA CCAATAGGCA 
,4457 

AACATGGACT TTCAAAAAGA AAAAATGACA 
4517 

TTGAGAGAGA GTCTTACTCT GCCGCCCAAA 
4577 

TGCAACCTCT ACCTCCTGGG TTCAAGTGAT 
4637 

ATTGCGGGCA TGGTGGCGTG AGCCTGTAGT 
4697 

ATAGCCTGAA CCTGGGAATC GGAGGTTGCA 
4757 

TGGCAATAGA CCGAGCTCCG TCTCCAAAAA 
4817 

TTTTAGTAAG TTAATGTATA TAAAAATGGC 
4877 

TTTTCAGTCA AAACTTTAAA CCTGTAGAAT 
4937 

AACATTTCAT AATTTGTTTC CAGCATGAGG 
4997 

TAATACTCTA TTTTTACATT TAAACCTTTT 
5057 

ACTCTCTTCC ACATGTTACT GGATAAATTG 
5117 

ATATGATGAC AATCAGTTAT ACAGTTATAA 
5177 

TTTTTATCTA TATAAAATAA CTAAAATGTA 



ATATAATGAA CAAAATTATT TGTAAGATAT 

CTTATATATG CACTTAATTT GTTGGGGCTT 

AATCAAGTAA TGAATCCAAC TGCTTAAAGT 

CGTTTACCTT CAAACTTTAG GTTTTTTTAA 

AATTAATCAC CCTACCAACT TTACTGTCCT 

CCATCTTTTA TTCTTTTTTT "PTTTTTTTTT 

CTGGAGTGCA GTGGCACAAT CTTGGCTCAC 

TCTCTTGCCT Ci^GCCTCCCG AGTTGCTGGG 
!•' 

t 

i 

CCTAGCTACT CGGGAGGCTG AGGCAGGAGA 
GGGCCAAGAT CGCCCCACTG CACTCCAGCC 
AAAAAATACA ATTTTTATTT CTTTTACTTT 
TTCGGACAAA ATATCTCTGA GTTCTGTGTA 
CAATTTAAGT GTTGAAAAAA ATTTGTCTGA 
TATCTAAGGA TTTAGACCAG AGGTCTAGAT 
ATTATAAGTC TTACATAAAC CATTTTTCTT 
TTTAGTGGAA AATAGGCTTT TTAATCATGA 
AATTAAAAGT TTGAAAAGCA ATATTGTATA 
TCTAAGAATA ATAAAATCAC GTTAAACCAA 
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5237 

AAAAAAAAAA AAAAAAAAAA AAAA 
5261 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asn Ser Ser Ser Ala Asn lie Thr Tyr Ala Ser Arg Lys Arg Arg 
15 10 15 

Lys Pro Val Gin Lys Thr Val Lys Pro lie Pro Ala Glu Gly lie Lys 
20 25 J 30 

Ser Asn Pro Ser Lys Arg His Arg Asp Arg Leu iAsn Thr Glu Leu Asp 
35 40 45 

Arg Leu Ala Ser Leu Leu Pro Phe Pro Gin Asp Val lie Asn Lys Leu 
50 55 60 

Asp Lys Leu Ser Val Leu Arg Leu Ser Val Ser Tyr Leu Arg Ala Lys 
65 70 75 80 

Ser Phe Phe Asp Val Ala Leu Lys Ser Ser Pro Thr Glu Arg Asn Gly 

85 90 95 

Gly Gin Asp Asn Cys Arg Ala Ala Asn Phe Arg Glu Gly Leu Asn Leu 
100 105 110 

Gin Glu Gly Glu Phe Leu Leu Gin Ala Leu Asn Gly Phe Val Leu Val 
115 120 125 

Val Thr Thr Asp Ala Leu Val Phe Tyr Ala Ser Ser Thr lie Gin Asp 
130 135 140 

Tyr Leu Gly Phe Gin Gin Ser Asp Val lie His Gin Ser Val Tyr Glu 
145 150 155 160 

Leu lie His Thr Glu Asp Arg Ala Glu Phe Gin Arg Gin Leu His Trp 

165 170 175 

Ala Leu Asn Pro Ser Gin Cys Thr Glu Ser Gly Gin Gly lie Glu Glu 
180 185 190 
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Ala 



Thr 



Gly 



Leu 



Pro 



Gin 



Thr Val Val Cys 
200 



Tyr 



Asn 



Pro 
205 



Asp 



Gin lie 



195 



Pro Pro Glu Asn Ser Pro Leu Met Glu Arg Cys Phe lie Cys Arg Leu 
210 215 220 

Arg Cys Leu Leu Asp Asn Ser Ser Gly Phe Leu Ala Met Asn Phe Gin 
225 230 235 240 

Gly Lys Leu Lys Tyr Leu His Gly Gin Lys Lys Lys Gly Lys Asp Gly 

245 250 255 

Ser lie Leu Pro Pro Gin Leu Ala Leu Phe Ala lie Ala Thr Pro Leu 
260 265 270 

Gin Pro Pro Ser lie Leu Glu lie Arg Thr Lys Asn Phe lie Phe Arg 
275 280 285 

Thr Lys His Lys Leu Asp Phe Thr Pro lie Gly Cys Asp Ala Lys Gly 
290 295 300 

Arg lie Val Leu Gly Tyr Thr Glu Ala Glu Leu Cys Thr Arg Gly Ser 
305 310 315 I' 320 

t 
i 

Gly Tyr Gin Phe lie His Ala Ala Asp Met Leu Tyr Cys Ala Glu Ser 

325 330 335 

His He Arg Met He Lys Thr Gly Glu Ser Gly Met He Val Phe Arg 
340 345 350 

Leu Leu Thr Lys Asn Asn Arg Trp Thr Trp Val Gin Ser Asn Ala Arg 
355 360 365 

Leu Leu Tyr Lys Asn Gly Arg Pro Asp Tyr lie lie Val Thr Gin Arg 
370 375 380 

Pro Leu Thr Asp Glu Glu Gly Thr Glu His Leu Arg Lys Arg Asn Thr 
385 390 395 400 

Lys Leu Pro Phe Met Phe Thr Thr Gly Glu Ala Val Leu Tyr Glu Ala 



Thr Asn Pro Phe Pro Ala He Met Asp Pro Leu Pro Leu Arg Thr Lys 

420 425 ' 430 

Asn Gly Thr Ser Gly Lys Asp Ser Ala Thr Thr Ser Thr Leu Ser Lys 

435 440 445 

Asp Ser Leu Asn Pro Ser Ser Leu Leu Ala Ala Met Met Gin Gin Asp 

450 455 460 

Glu Ser He Tyr Leu Tyr Pro Ala Ser Ser Thr Ser Ser Thr Ala Pro 



405 



410 



415 
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465 



470 



475 



Phe Glu Asn Asn Phe Phe Asn Glu Ser Met Asn Glu 

485 490 

Gin Asp Asn Thr Ala Pro Met Gly Asn Asp Thr lie 
500 505 

Gin lie Asp Gin Pro Gin Asp Val Asn Ser Phe Ala 
515 520 



480 

Cys Arg Asn Trp 
495 

Leu Lys His Glu 
510 

Gly Gly His Pro 
525 



Gly Leu Phe Gin Asp Ser Lys Asn Ser Asp Leu Tyr Ser lie Met Lys 
530 535 540 



Asn Leu Gly lie Asp Phe Glu Asp lie Arg His Met 
545 550 555 

Phe Phe Arg Asn Asp Phe Ser Gly Glu Val Asp Phe 

565 570 

Leu Thr Asp Glu lie Leu Thr Tyr Val Gin Asp Ser 
580 585 i 

Pro Phe lie Pro Ser Asp Tyr Gin Gin Gin Gin Iser 
595 600 



Gin Asn Glu Lys 
560 

Arg Asp lie Asp 
575 

Leu Ser Lys Ser 
590 

Leu Ala Leu Asn 
605 



Ser Ser Cys Met Val Gin Glu His Leu His Leu Glu Gin Gin Gin Gin 
610 615 620 

His Hi,s Gin Lys Gin Val Val Val Glu Pro Gin Gin Gin Leu Cys Gin 
625 630 635 640 

Lys Met Lys His Met Gin Val Asn Gly Met Phe Glu Asn Trp Asn Ser 

645 650 655 



Asn Gin lie Val Pro Phe Asn Cys Pro Gin Gin Asp 
660 665 

Asn Val Phe Thr Asp Leu His Gly lie Ser Gin Glu 
675 680 

Ser Glu Met Asp Ser Met Pro Tyr Thr Gin Asn Phe 
690 695 700 



Pro Gin 
670 



Gin Tyr 
Tyr Lys 
lie Ser Cys Asn 



Phe Pro 
685 



Gin Pro Val Leu Pro Gin His Ser Lys Cys Thr Glu Leu Asp Tyr Pro 
705 710 715 720 

Met Gly Ser Phe Glu Pro Ser Pro Tyr Pro Thr Thr Ser Ser Leu Glu 

725 730 735 

Asp Phe Val Thr Cys Leu Gin Leu Pro Glu Asn Gin Lys His Gly Leu 
740 745 750 
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Asn Pro Gin Ser Ala 
755 

Val Ser Met Tyr Gin 
770 

Gin Met Gin Tyr Asn 
785 

Lys Phe Gin Asn Gly 

805 

Asn lie Asn Asn Thr 
820 

Pro Ser Glu Ala Arg 
835 



lie He 



Cys Gin 
775 

Pro Val 
790 



Thr Pro 
760 

Pro Glu 



Gin Thr Cys 



Leu Pro 



Val Leu Asn Glu 
Gin Thr 
Pro Phe 



Thr Thr 
825 



Pro Asp 
840 



Pro Gin His 
780 

Gly Gin Gin 
795 

Thr Tyr Pro 
810 

His Leu Gin 



Leu Thr Ser 



Tyr Ala Gly Ala 
765 

Thr His Val Gly 



Ala Phe Leu Asn 
800 

Ala Glu Leu Asn 
815 

Pro Leu His His 
830 

Ser Gly Phe Leu 
845 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 
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(B) LOCATION: 19 

(D) OTHER INFORMATION: /inod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 29 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 32 

(D) OTHER INFORMATION: /inod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 43 

(D) OTHER INFORMATION: /mod_base= i j 

(ix) FEATURE: i' 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 46 

(D) OTHER INFORMATION: /mod_base= i 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

TTNATNCCTC TCNGCNGGNA TNGGTCTTNA CNGTTCTTTC TGNACNGGTC TT 
52 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /mod_base= OTHER 

/n9te= "Can be either adenine, thymine, 

guanos ine, 

or cytosine." 
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(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /inod_base= OTHER 

/note= "Can be either adenine, thymine, 

guanosine, 

or cytosine." 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AAAGCCNGTN CAAGAAAGAC 
20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

i 

(ii) MOLECULE TYPE: DNA (genomic) !• 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGATTTGACT TAATTCCTTC AGGGG 
25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCATCGATCT CGAGAGATTG CAGATAGCAA GGTTTGGTGC 
40 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCATCGATCT CGAGTGTAAT GAGTGAATTG AATGGTGC 
38 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GAAGATCTTC CAGTGGTCCC AGCCTACACC 
30 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

GAAGATCTTC ATGTGAACTT GCTGACGTCC 
30 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCTCTAGATG ATCACCATGG TGCAGAAGAC CGTGAAGCCC ATCCCCGCTG AAGGAATTAA 
60 

GTC 

63 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown < 

(ii) MOLECULE TYPE: DNA (genomic) ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GCACTAGTTG ATCACCATGG CCAGCCGCAA GCGGCGCAAG CCGGTGCAGA AGACCGTGAA 
60 

GCC 

63 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCACTAGTTG ATCACCATGA GCAGCGGCGC CAACATCACC TATGCCAGCC GCAAGCGCCG 
60 
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CAAGC 
65 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

GCAGAGTCTG GGTTTAGAGC 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

TCGAGTAGAT CACGCAATGG GCCCAGC 
27 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
TCGAGCTGGG CCCATTGCGT GATCTAC 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

GCGTCGACTG GGCACCATGA ACAGCAGC 
28 

(2) INFORMAT,ION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CCCAAGCTTA CGCGTGGTTC TCTGGAGGAA GCTGGTCTGG 
40 

(2) 'INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CCCAAGCTTA CGCGTGGAAG TCTAGCTTGT GTTTGG 
36 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CCCAAGCTTA CGCGTGAAGC CGGAAAACTG TCATGC 
36 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown , 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCCAAGCTTA CGCGTGCAGT GGTCTCTGAG TGGCGATGAT GTAATCTGG 
49 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCCAAGCTTA CGCGTGGTCT TTGAAGTCAA CCTCACC 
37 

(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

GAATTGTAAT ACGACTCACT ATAGGG 
26 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CGCTCGAGAA CTAGTGGATC 
20 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

AGCTGCTTAA TTAATTAAGC A 
21 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

AGCTTGCTTA ATTAATTAAG C 
21 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

GCGTCGACTG ATGAGCAGCG GCGCCAACAT CACC 
34 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

CATTACTTAT CTAGAGCTCG 
20 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GATTTAGGTG ACACTATAG 
19 

(2) INFORMATION FOR SEQ ID N0:3li' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

1' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GATAAGAATG CGGCCGCACG GATCCAGCAG CAACAGCAAA CAGAATTGG 
49 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATAAGAATGC GGCCGCAGCC CCCCCGACCG ATGTCAGC 
38 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

ATAGTTTAGC GGCCGCCCCA CCGTACTCGT CAATTCC 
37 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

GCCGTCGACG CGGCCGCGAA GTCTAGCTTG TGTTTGG 
37 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

ATAAGAATGC GGCCGCACCC TCAATGTTGT GTCGGG 
36 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA. (genomic) 

I 
I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

CGGGATCCTC GCGGCCGCAG AGAATTTCAG GAATAGTGGC 
40 
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